Wideband Stereo Speech Coding for Teleconferencing Applications
نویسندگان
چکیده
Almost a ll voice communications are based on monophonic narrowband speech. Wideband stereophonic communications provide a more natural sounding environment. This holds especially true for teleconferencing applications, where the localization information in the stereo signal adds a new dimension to the communication. As of today, there is no standardized speech codec with full stereo support. Statistical analysis shows that there exists correlation between left and right channels of a stereo speech signal. Instead of coding both channels independently (2 x mono bit-rate) stereo parameters are extracted which, combined with the mono signal, allows for low bit rate stereo reproduction. This thesis report describes a parametric stereo coding algorithm. The algorithm, which is targeted at conversational applications such as teleconferencing, reproduces the stereo signal from a down-mixed mono signal and additional stereo parameters. During the thesis work, different stereo coding methods have been studied and evaluated. Given design constraints such as low complexity, low delay and low bit rate, the stereo coding algorithm was based on adaptive inter-channel prediction techniques. In the thesis, novel solutions to inherent problems of the inter-channel prediction framework are presented. A parametric stereo extension, based on the new findings, has been implemented into the standardized Adaptive Multirate Wideband speech codec. Although the bit-rate cost for the stereo extensions is as low as 1 kbps, MUSHRA tests show that the new codec is able to reproduce a high-quality stereo speech signal.
منابع مشابه
Wideband Audio
This chapter covers key technologies in wideband audio coding including auditory masking, perceptual coding, frequency domain coding, and dynamic bit allocation. The MPEG standardization work is then described. MPEG algorithms have found a wide range of communications-based and storage-based applications. For example, European digital audio broadcast (DAB) makes use of MPEG-1. It will then be s...
متن کاملDigital Audio for Multimedia
The paper covers key technologies in wideband audio coding including auditory masking, perceptual coding, frequency domain coding, and dynamic bit allocation. The MPEG standardization work is then described. MPEG algorithms have found a wide range of communications-based and storage-based applications. For example, the European digital audio broadcast (DAB) makes use of MPEG -1. It will then be...
متن کاملFloating-point Adaptive M Speech Cod
The Adaptive Multi-Rate Wideband (AMR-WB) speech codec algorithm has been selected for wideband speech coding in wireless and wireline services by both 3GPP and ITU-T. This paper describes an implementation of floating point Adaptive Multi-Rate Wideband (AMR-WB) codec that was approved by the Third Generation Partnership Project (3GPP) for multimedia applications.
متن کاملOn the utilization of overshoot effects in low-delay audio coding
In the speech coding community a low-delay coding usually means that the coding delay should be less than 5 ms, e.g, 2 ms in ITU G.728 [2]. It is reasonable to adopt this rule also to low-delay wideband audio coding. An acoustic signal propagates in air less than one meter within 2 ms and therefore this low coding delay should not produce significant audible echo problems for example in high qu...
متن کاملA WIDEBAND CELP CODER AT 16 kbit/s FOR REAL TIME APPLICATIONS
Since its introduction in 1984, Code Excited Linear Predictive (CELP) [1] coding has received considerable attention for high quality speech coding at low bit-rates. Although most of the research has been focused on coding of narrowband (200-3400 kHz) speech, some recent studies on CELP coding of wideband (50-7000 kHz) speech have been reported [2], [3], [4]. A possible application for wideband...
متن کامل